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LISTING OF CLAIMS 

1 . (Currently Amended) A computer implemented method of generating context vectors 
representing information e l e m e nt s items f or retrieval of the information e le m e nts items or 
records containing the information items ele m e nts , the method comprising. 

assigning a context vector to each of a plurality of information e l e m e nts items ; 

initializing the context vectors such that the context vectors are substantially orthogonal 
to each other in a vector space; 

determining proximal co-occurences of the information ele m e nts items ; and 

adjusting the context vectors based on the proximal co-occurrences of the information 
olomontc items , such that the information items elements that frequently proximally 
co-occur have context vectors with similar orientations in the vector space. 

2. (Original) The method of claim 1 , wherein initializating the context vectors further 
comprises: 

assigning vector components to the vectors using zero-mean, unit-variance Gaussian 
random number generation. 

3. (Currently Amended) The method of claim 1 , wherein a target context vector is a 
context vector assigned to a target information item -e l e m e nt and a neighbor context vector is a 
context vector assigned to an information item e l e m e nt that proximally co-occurs with the target 
context vector, and wherein adjusting the context vectors comprises: 

for each target context vector to be adjusted: 

determining an error vector between a target context vector and each 

neig hbor context vector; 
updating the target context vector as a function of the error vectors. 
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4. (Currently Amended) The method of claim 3, wherein updating the target context 
vector as a function of the error vectors comprises: 

determining a correction vector from the error vectors, where the correction vector is: 

WS 

where: 

Ejj is the error vector Ejj = Ny - Tj between the neighbor context vector Njj and 

a target context vector T j; 
WS is a window size containing the target context vector and the neighbor 

context vectors; and 
q is a proximity constraint; and 
updating the target context vector as 

J J F J F=I - 

where: 

Y is a step size; 

Fj is the totai number of occurrences of information e l em e nt item j; and 
M is a mean context vector for all unique context vectors, 

5. (Currently Amended) The method of claim 1 wherein a target context vector is a 
context vector assigned to a target information olomont item , and a neighbor context vector is a 
context vector assigned to an information e l e m e nt item t hat proximally co-occurs with the target 
context vector, and wherein adjusting the context vectors comprises: 

determining a weighted sum vector of neighbor context vectors of a target context 
vector; 

applying the weight sum vector to the target context vector. 

-8- 
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6. (Currently Amended) The method of claim 5, further comprising: 
determining the weighted sum vector according to the equation: 



where: 

W is the weighted sum vector; 

Ny is the neighbor context vector to target context vector Tj ; 

G(i) is a Gaussian weight for the neighbor context vector i; and 

Dj is the number of documents that contain target information item el em e nt j; 

and 

applying the weighted sum vector to the target context vector according to the equation: 
T NBW =T OU> +Wj 

where: 

j.wEWjg the ,^3^ target context vector; and 
Tj ozx> is the un-updated target context vector. 



7. (Currently Amended) The method of claim 1 wherein a target context vector is a 
context vector assigned to a target information ele ment item , and a neighbor context vector is a 
context vector assigned to an information e l e m e nt item that proximaliy co-occurs with the target 
context vector, and wherein adjusting the context vectors comprises: 

determining a weighted sum vector of neighbor context vectors of a target context vector 
according to the equation 

-9- 
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where: 

W is the weighted sum vector; 

G(i) is a Gaussian weight for the neighbor context vector i; and 

Dj is the number of documents that contain target information o le m e rt item j; 

determining an error vector from the weighted sum vector and the target context vector: 
where 

E is the error vector; 
T is target context vector; 
determining a correction vector C from the error vectors of the neighbor context vectors: 

applying the correct vector to the target vector: 

where: 

j new j s ^ e updated target context vector; and 
T° L0 is the un-updated target context vector. 
Y is a step size; and 

M is a mean context vector for all unique context vectors. 



-10- 
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8. (Original) The method of claim 1 , wherein the vector space is defined by a plurality of 
axes, and the context vector include vector components corresponding to the respective axes, 
and wherein the axes individually do not have specific semantic associations. 

9. (Currently Amended) The method of claim 1, wherein a target. context vector is a 
context vector assigned to a target information e l e m e nt item , and a neighbor context vector is a 
context vector assigned to an information ete rn e nt item that proximally co-occurs with the target 
context vector, and wherein adjusting the context vectors comprises: 

adjusting the target context vector as a function of the relative importance of at least one 
of either the target information element or the neighbor context vector with respect to 
the plurality of information e l em e nt s item . 

1 0. (Currently Amended) The method of claim 1 , wherein a target context vector is a 
context vector assigned to a target information o l emont item , and a neighbor context vector is a 
context vector assigned to an information e l e m e nt item that proximally co-occurs with the target 
context vector, and wherein adjusting the context vectors comprises: 

adjusting the target context vector as a function of the frequency of occurrence of the 
target information e l e m e nt item : the frequency of occurrence of the neighbor 
information element item ; a total number of records containing the target information 
e l e m e nt item , and a total number of record containing the neighbor information 
elefneFrt item . 

1 1 . (Currently Amended) The method of claim 1 , wherein a target context vector is a 
context vector assigned to a target information e l e m e nt item , and a neighbor context vector is a 
context vector assigned to an information element item that proximally co-occurs with the target 
context vector, and wherein adjusting the context vectors further comprises; 

adjusting the target context vector as a function of: 

-li- 
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a distance of the neighbor information element item from the target 

information ol o mont item , so that neighbor information ol o m e n te items 
that are closer to the target information etemeet item cause the target 
context vector to be adjusted to be closer to the neighbor context vector; 
and 

a frequency of occurrence of the neighbor information o lo mont item in 

records containing the information e le m e nt s items , such that a neighbor 
information el em e nt item that is less frequently occurring more strongly 
causes the target context vector to be adjusted to be closer to that 
neighbor context vector than a neighbor information o l om e n t item that is 
more frequently occurring.. 

12. (Currently Amended) The method of claim 1, further comprising: 

for at least one record comprising a plurality of information element* items, determining 
a summary context vector for the record from the normalized sum of the context 
vectors of the information olom e nts items that comprise the record. 

13. (Currently Amended) The method of claim 12, further comprising: 
receiving a query comprising at least one information el e m e nt item; 

generating a query context vector from the information element item that comprise the 
query; 

retrieving at least one record having a summary context vector with a orientation in the 
vector space similar to the orientation of the query context vector. 



12 ■ 
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14. (Original) The method of claim 13, wherein retrieving at least one record having a 
summary context vector with a orientation in the vector space similar to the orientation of the 
query context vector further comprising: 

a tree walk of a cluster tree, the cluster tree comprising a hierarchical plurality of nodes, 
each node having a cluster centroid vector, each cluster centroid vector associated 
with a cluster of one or more records, and derived from the one or more records 
contained in the cluster, the tree walk performed by iternatively selecting a node of 
the cluster tree that has a centroid cluster vector with a closest orientation in the 
vector space to the query context vector. 

15. (Original) The method of claim 1 , further comprising; 

clustering the context vectors into a plurality of dusters, each cluster having a centroid 
vector derived from the plurality of context vector contained in the cluster. 

16. (Currently Amended) The method of claim 1, wherein: 

information ele m e nts items having similar meaning have context vectors with similar 
orientations in the vector space. 

17. (Currently Amended) The method of claim 1 , further comprising: • 
determining a similarity of meaning between a first information et c m e ft t -item and a 

• second information item bv performing a vector operation on the context vectors of 
the first and second information e l emoBta - items , 

18. (Currently Amended) A computer implemented method of generating vectors 
representing information items for retrieval of the information e l e m e nts Kerns , the method 
comprising: 

selecting a set of R information elements- items : 

- 13 - 
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determining for the selected set of information elements an RxR mutual co-occurrence 

matrix based on proximal co-occurences of the information etements-items in a 

plurality of documents; 
applying Singular Value Decomposition to the mutual co-occurrence matrix to produce a 

set of first context vectors, the first context vectors having orientations in a D 

dimensional vector space, where D<<R; and 
wherein each first context vector is uniquely associated with one of the selected 

information ele m e nts i tems , and wherein information elements - i tems having similar 

meaning have respective first context vectors with similar orientations in the vector 

space. 

19. (Currently Amended) The method of claim 13, wherein the mutual co-occurrence 
matrix comprises for each pair of selected information e l e m e nts items , a normalized measure of 
the frequency of proximal co-occurrence of the pair of selected information oloments items . 

20. (Currently Amended) The method of daim 13, wherein the selected set of 
information e l e m e nts items comprises a first selected set, the method further comprising: 

selecting a second set of information e l e m e nts items different from the first selected set 

of information e l e m e nts items : 
associating each of the second set of information e l e m e nts items with a second context 

vector; and 

updating the second context vectors at least once using the first context vectors, wherein 
the first context vectors are fixed during at least one update of the second context 
vectors. 



- 14 - 
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21. (Currently Amended) A computer implemented method of retrieving a record from a 
database containing a plurality of records, each record containing at least one information 
el oment - item having an associated context vector, the method comprising: 

for each of a plurality of information ele m e nts items , storings context vectors un i qu e ly 
associated with the information ele m e nt item the context vectors having the 
properties that information e le m e nts j temg having similar meaning have context 
vectors with similar orientations in a vector space, and information e l e m e nt s j tems 
having dissimilar meanings have context vectors with dissimilar orientations in the 
vector space; 

for each of the plurality of records, storing a summary context vector derived from 
context vectors respectively associated with information ele m e nts items that 
comprise the record; 

receiving a query; 

deriving at least one query information e l e m e nt item from the query; 
generating a query context vector from the query information el e m e nt item : and 
selecting at least one record having a summary context vector with orientation in the 
vector space that is similar to the orientation of the query context vector. 

22. (Original) The method of claim 21 , wherein selecting at least one record further 
comprises: 

for each of a plurality of records, determining a distance in the vector space between the 
query context vector and a summary context vector of a record; and 

selecting the record having a least distance between its summary context vector and the " 
query context vector. 
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23. (Currently Amended) A computer implemented of providing a universal meaning 
space for human understandable information el e m e nts items , the method comprising: 

selecting a set of first information ele m e nts items : 

creating a first set of context vectors based on proximal co-occurrences of the first 
information e l e m e nts items in corpus of records, each first context vector uniquely 
associated with one of the first information e l e m e nt s items , the context vectors 
having an orientation in a vector space, such that first information e l e m e nts items 
having similar meaning have context vectors with similar orientations in the vector 
space; 

selecting a set of second information e l e m e nts items , the second information otomonto 

items being different from the first information elemef^items; 
selecting a subset of the first information e l e m e nts items : 

for each first information olomont item in the subset, selecting a corresponding second 

information ol o m o nt item having a human understandable meaning substantially 

identical to the meaning of the first information e l e m e nt item ; 
for each of the selected second information olemo R t s Htems . associating the second 

information e l ement -item with the context vector of the corresponding first 

information e l e m e nt item : 
assigning a context vector to each non-selected second information e l e m e nt item : and 
adjusting the context vectors of the non-selected second information e l e m e nt s items 

using the context vectors of the selected second information o l omonto items . 

24. {Currently Amended) The method of claim 23, wherein: 

the first information ele m e nts items are words of a first human language, and the second 
information ele me n ts - items are words of a second, different human language; and 

the subset of first information e l e m e nts items and the corresponding second information 
ele m e nts items have substantially identical meaning. 

- 16- 
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25. (Currently Amended) The method of claim 23, wherein: 

the first information ele m e nts items are symbolic representations of words of a human 
language encoded in a first data format, and the second information elements-items 
are symbolic representations of non-text data encoded in a second data format 
different from the first format; and 

the subset of first information e l e m e nt s items and the corresponding second information 
e l em e nts items have substantially related meaning even though they have different 
data formats and different symbolic representations. 

26. (Currently Amended) The method of claim 23, wherein adjusting the context vectors 
of the non-selected second information ele m e nt s items further comprises; 

adjusting the context vectors of the non-seiected second information ele m e nt s j tems, 
such that non-selected second information el em e nt s items and selected second 
information e l em e nts items having similar meaning have context vectors with similar 
orientations in the vector space. 

27. (Currently Amended) A computer-implemented process of generating a dictionary of 
information ele m e nts items for a database of records, each record including at least one 
information e l e m e nt item , each information clement item associated with a context vector, each 
information o l omont - rtem having a determinate proximity to other information e lomonto items in 
a record, wherein a neighbor information e l e m e nt i tem is an information e l e m e nt J hat occurs 
proximate a target information element-item in at least one record in the database, the method 
comprising; 

initializing the context vectors associated with information ele m e nt s -items in the 

dictionary, such that initial context vectors are substantially orthogonal to each other 
in a vector space; 

for each information elemefrt-item being a target information ele m e nt item: 

- 17- 
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selecting neighbor information e l e m e nts i tems of the target information 
ele m e nt item in at least one record; 

modifying the context vector of the target information etement-item using the 
context vectors of each selected neighbor information elements items as 
a function of the proximity of each neighbor information element -item to 
the target information ele m e nt item , and a co-importance of the target 
information element-item and the neighbor information ele m e nt j tem. 

28. The method of claim 27, further comprising: 

determining the co-importance according to the relative importance of the target 
information e l e m e nt item and the relative importance of the neighbor information 
o l omont item . 

29. The method of claim 28, wherein determining the co-importance comprises: 
determining a first relative importance of the target information o l oment item , inversely 

according to the frequency of occurrence of the target information e l e m e nt item in 
the records; 

determining a second relative importance of the neighbor information ele m e nt item 
inversely according to the frequency of occurrence of the neighbor information 
ele m e nt item in records; and 

determining the co-importance as a function of the first relative importance and the 
second relative importance. 

30. (Currently Amended) The method of claim 29, further comprising; 
determining the relative importance of an information ele m e nt item by the equation: 
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ND . 

f,= fl+(1-S)(i- r-H 

!og{ ) 

TND 

wherein: 

lj is the relative importance of information eromont item J; 
B represents a predefined lower bound; 

NDj represents the number of records containing information elemont item J; 
and 

TND represents the total number of records. 

31 . (Currently Amended) The method of claim 27, wherein the co-importance of the 
target information elemet^item and the neighbor information element-item is determined using 
the equations: 

1>Ctn-Mn 

wherein: 

Ctn is the co-importance of the target information e lomont item and the 

neighbor information e l e mont item : 
l T is the relative importance of the target information ele m eert -item : and 
l N is the relative importance of the neighbor information e lem e nt - j tem; and 

ND . 

2)ij= B + (1- 5)<1- -J- )f 

log(— — ) 
TND 

wherein: 

lj is the relative importance of an Information ele m e nt -item J; 
B represents a predefined lower bound; 

NDj represents the number of records containing information elomo Bt -jjem J; 
and 
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TND represents the total number of records. 

32. (Currently Amended) The method of claim 27, . wherein a proximity constraint varies 
a magnitude of the modification to the context vector of the target information ele m e nt item as a 
function of both the frequency of occurrence of the target. information e l omont item and the 
frequency of the occurrence of each neighbor information ele m e nt item in the records, so that 
the context vectors of. information ele m e nts items that frequently proximally co-occur do not 
converge. 

33. In a computer system including a storage device containing a plurality of records, 
each record containing a plurality of information ele m e nts items , a computer readable medium 
for configuring and controlling the computer system to generate a plurality of context vectors, 
the computer readable medium comprising; 

an initial context vector generation module, adapted to read and write to the storage 
device, which initializes to each of a plurality of selected information element-item an 
initial context vector, such that the initial context vectors are substantially orthogonal 
to each other in a vector space, and which writes the initial context vectors to the 
storage device in association with respective information ele m e nts - items : 

a vector training module, adapted to read and write to the storage device, for modifying 
the context vector of a selected information-e l e m e nt item , being a target information 
eJefmf^item. using the context vectors of neighbor information e l e m e ntS Htems that 
proximally co-occur with the target information element-item, as a function of the 
proximity of each neighbor information e lement item to the target information elomont 
item, and a co-importance of the target information element -item and the neighbor 
information element- item . 



-20- 
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34. (Original) A method of automatically indexing documents using a defined index of 
terms, the method comprising: 

providing an indexed collection of documents, each document having at feast one index 
term assigned to the document; 

providing a plurality terms, including the index terms, each term associated with a 

context vector, the context vector having the properties that that terms having similar 
meaning have context vectors with similar orientations in a vector space, terms 
having dissimilar meanings have context vectors with dissimilar orientations in the 
vector space, and terms which frequently proximally co-occur have context vectors 
with similar orientations in the vector space; and 

generating for each indexed document a context vector from the context vectors of 
selected terms that comprise the document; 

receiving a new document to be indexed; 

generating a new context vector of the new document, the new context vector generated 
from the context vectors of selected terms that comprise the new document; 

selecting at least one indexed document having a context vector similar to the new 
context vector; 

assigning to the new document at least one index term assigned to a selected indexed 
document. 

35. (Original) The method of claim 34 t wherein assigning to the new document at least 
one index term further comprises: 

for each selected indexed document, assigning a weight to each index term assigned to 
the indexed document, the weight proportional to the similarity between the new 
context vector and the context vector of the indexed document, such that the weight 
is higher where the context vectors are more similar; 

-21 - 
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for each index term, generating an index term score as a function of a number of 

occurrences of the index term in each selected indexed document, and the weight of 
the index term with respect to each selected indexed document, such that the index 
term score is higher as the number of occurrences of an index term increases; and 

assigning to the new document at least one of the index terms with a high index term 
score. 
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